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Abstract 

Codon usage bias has been observed in various organisms. In this study, the correlation between SHH genes ex- 
pression in some tissues and codon usage features was analyzed by bioinformatics. We found that translational se- 
lection may act on compositional features of this set of genes. 
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Synonymous codons are codons which code for the 
same amino acid. Non-random usage of these codons is a 
widespread occurrence which has been observed in many 
organisms such as Drosophila melanogaster, E.coli and 
Saccharomyces cerevisiae (Sharp and Li, 1987; Ikemura, 
1985; Powell and Moriyama, 1997). One of the models that 
express such biases is translational selection (Zuckerkandl 
and Pauling, 1965; Ikemura, 1981; Serensen et al, 1989; 
Debry and Marzluff, 1994; Levy et al, 1996). Correlations 
between synonymous codon usage and gene expression 
levels are usually attributed to the higher abundance of 
isoaccepting tRNA for optimal codons to maximize trans- 
lational efficiency both in unicellular (Dong et al, 1996; 
Rocha, 2004) and multicellular organisms (Moriyama et 
al, 1997; Duret, 2000; Kanaya et al, 2001; Lavner and 
Kotlar, 2005). In vertebrates there has been much debate on 
this correlation (Wolfe and Sharp, 1993; Musto et al. , 200 1 ; 
Konu and Li, 2002). Moreover, this correlation is not clear 
in mammals including human beings. 

Thus, analyzing codon usage bias in specific tissues 
and specific subsets of genes is an interesting area of inves- 
tigation. The Sonic Hedgehog (SHH) pathway is one of the 
most important developmental pathways which is con- 
served from flies to human. The proteins that are involved 
in this pathway are not only critical in embryonic develop- 
ment but have also been implicated in certain cancers. In 
this study we analyzed the correlation between synony- 
mous codon usage features of SHH pathway genes and 
their expression in human tissues. The results are expected 
to provide valuable information on translational efficiency 
of this pathway in humans. By such information, analyzing 
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cancer mechanisms and causative mutations may become 
more expedient. 

SHH pathway genes were collected from a Hedgehog 
Signaling Pathway Database. In total 31 genes were se- 
lected from SHH signal receiving cells. The NCBI database 
was used to obtain their CDSs. To avoid statistical errors, 
the CDSs were aligned by ClustalW. Similarity scores of 
every two genes were between one and 68. Normalized ex- 
pression levels in the brain, embryonic tissue, prostate, 
ovary, testis, liver, muscle, and the eye were retreived from 
the SOURCE database. By using the FREQSQ program we 
calculated synonymous codon usage features and percent- 
age of each synonymous codon in each codon family that 
codes for the same amino acid. Statistical analysis was per- 
formed by using the MINITAB 1 3 program. Codons with p 
values below 0.01 were considered as significant features. 

After analyzing 59 synonymous codon usage features 
(all codons except termination codons, Methionine, and 
Tryptophan codons) for 31 genes in the SHH pathway, 
1 829 features were obtained. Then, by analyzing the rela- 
tionship between gene expression levels in eight tissues and 
synonymous codon usage features, 13 significant features 
were noted in four tissues (brain, ovary, testis and the eye) 
(Table 1). Among significant features, three of "synony- 
mous codon usage" were shared by two tissues. Regression 
equations were then obtained for significant features 
(Table 2). For one amino acid (L) we could find two differ- 
ent and significant codon usage features in brain and testis. 

The above analysis revealed a correlation between 
synonymous codon usage and expression level of SHH 
genes in brain, testis, ovary, and the eye. But this correla- 
tion was not observed in embryonic tissue, prostate, liver, 
and muscle. Thus, translational selection may select synon- 
ymous codons in genome sequences. Our analysis indicates 
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Table 1 - Significant features for the correlation between expression levels of SHH genes and synonymous codon usage features in human tissues. 



Tissue 


Number of expressed genes 


Average expression 


Highest expressed gene 


p value 


r s value 


Feature 


Brain 


31 


63.84 


PRECAKB 


0.001 


0.548 


CTA(L) 










0.002 


0.542 


TGT(C)* 










0.002 


0.542 


TGC(C)* 










0.002 


0.542 


CCA(P)* 


Ovary 


21 


3.905 


GPC1 


0.008 


-0.563 


ATA(I) 










0.004 


0.603 


CAC(H) 










0.004 


-0.606 


GGA(G) 


Testis 


28 


15.43 


IFT88 


0 


0.647 


TTA(L) 










0.004 


0.522 


CCA(P)* 










0.002 


0.561 


ACT(T) 










0.005 


0.519 


TGT(C)* 










0.005 


-0.519 


TGC(C)* 


Prostate 


24 


5.304 


CSNK1A 








Embryonic tissue 


22 


5.45 


GPC4 








Eye 


26 


6.231 


GSK3A and PRECAKB 


0.008 


0.509 


TCA(S) 


Liver 


20 


6.65 


CSNK1A 








Muscle 


26 


7.375 


GLI1 









*: Common features. 



that SHH gene expression may be regulated at a post- 
transcriptional level. Common features between two tis- 
sues support the assumption that the same mechanism may 
act in common pathways. For instance, codon usage bias 
can be attributed to the frequency of isoaccepting tRNAs in 
a tissue. This may also account for the herein observed cor- 
relations, because differences in relative tRNA abundance 
with a maximum range of tenfold variation have been de- 
tected in different human tissues (Ditmar et al., 2006). Fur- 
thermore, CTA and TTA, which encode leucin, are us- 

Table 2 - Regression equations for significant features. 
Tissue Equation 



Brain (4 significant features) 


EXP = 


- 49.9 + 13.7 CTA(L) 




EXP = 


- 75.6 + 2.79 TGT(C) 




EXP = 


203-2.79 TGC(C) 




EXP = 


- 89.3 + 5.26 CCA(P) 


Ovary (3significant features) 


EXP = 


6.68-0.187 ATA(I) 




EXP = 


- 1.34 + 0.0954 CAC(H) 




EXP = 


8.57-0.165 GGA(G) 


Testis (5significant features) 


EXP = 


3.54+ 1.28 TTA(L) 




EXP = 


-5.19 + 0.671 CCA(P) 




EXP = 


-13.4+ 1.03 ACT(T) 




EXP = 


-3.02 + 0.348 TGT(C) 




EXP = 


31.8-0.348 TGC(C) 


Eye (1 significant feature) 


EXP = 


1.45 + 0.310 TCA(S) 



age-biased in the brain and testis. This result indicates the 
criticality of this amino acid and its tRNAs in the two tis- 
sues. There are no significant features for synonymous 
codon usage in prostate, embryonic tissues, liver, and mus- 
cles, wherefore translational selection in these tissues may 
be absent 

The average expression in brain and testis was higher 
than in other tissues. This result indicates translational effi- 
ciency and accuracy. In fact, the evolution of codon bias in 
highly expressed genes is hypothesized to be a result of nat- 
ural selection for increased protein elongation rates (Bul- 
mer, 1999) or minimized errors in mRNA translation 
(Akashi, 1994). Furthermore, the frequency of tRNAs or 
factors which are involved in translation may be different in 
various tissues. This result has notable implications for un- 
derstanding the molecular mechanisms of tissue develop- 
ment and cancer. It is important to mention that other 
compositional features may also affect gene expression in 
selected tissues. Appropriate analysis can help in under- 
standing molecular mechanisms of gene expression and 
mRNA translation. Hence, the relationships between ge- 
nome, transcriptome and proteome may become clearer. 
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